Learning Accurate Integer Transformer Machine-Translation Models
نویسندگان
چکیده
We describe a method for training accurate Transformer machine-translation models to run inference using 8-bit integer (INT8) hardware matrix multipliers, as opposed the more costly single-precision floating-point (FP32) hardware. Unlike previous work, which converted only 85 multiplications INT8, leaving 48 out of 133 them in FP32 because unacceptable accuracy loss, we convert all INT8 without compromising accuracy. Tested on newstest2014 English-to-German translation task, our Base and Big yield BLEU scores that are 99.3–100% relative those corresponding models. Our approach converts matrix-multiplication tensors from an existing model into by automatically making range-precision trade-offs during training. To demonstrate robustness this approach, also include results INT6
منابع مشابه
Weighted Transformer Network for Machine Translation
State-of-the-art results on neural machine translation often use attentional sequence-to-sequence models with some form of convolution or recursion. Vaswani et al. (2017) propose a new architecture that avoids recurrence and convolution completely. Instead, it uses only self-attention and feed-forward layers. While the proposed architecture achieves state-of-the-art results on several machine t...
متن کاملSteps Toward Accurate Machine Translation
A novel multi-phase architecture for an accurate machine translation system is proposed. The system is divided into three phases: quick and dirty machine translation (QDMT), conceptual comparison and repair and iterate. QDMT generates the appropriate translation candidates (TCs) in the target language for the input sentence in the source language. Next, the system compares the meaning of the TC...
متن کاملMachine Learning for Integer Programming
Mixed Integer Programs (MIP) are solved exactly by tree-based branch-and-bound search. However, various components of the algorithm involve making decisions that are currently addressed heuristically. Instead, I propose to use machine learning (ML) approaches such as supervised ranking and multi-armed bandits to make better-informed, input-specific decisions during MIP branch-andbound. My thesi...
متن کاملMeasure Transformer Semantics for Bayesian Machine Learning
The Bayesian approach to machine learning amounts to computing posterior distributions of random variables from a probabilistic model of how the variables are related (that is, a prior distribution) and a set of observations of variables. There is a trend in machine learning towards expressing Bayesian models as probabilistic programs. As a foundation for this kind of programming, we propose a ...
متن کاملInteractive Machine Translation using Hierarchical Translation Models
Current automatic machine translation systems are not able to generate error-free translations and human intervention is often required to correct their output. Alternatively, an interactive framework that integrates the human knowledge into the translation process has been presented in previous works. Here, we describe a new interactive machine translation approach that is able to work with ph...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: SN computer science
سال: 2021
ISSN: ['2661-8907', '2662-995X']
DOI: https://doi.org/10.1007/s42979-021-00688-4